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1 .  Introduction 

\  The  main  ideas  in  selecting  the  best  populations  meeting  some  prescribed 
optimality  criterion  have  been  mooted  originally  by  Bechchofer  (1954)'  and 
Gupta  (1956)  and  the  subject  has  gone  from  strength  to  strength  by  several 
contributions  by  several  statisticians  over  the  last  three  decades.  In  this 
paper,  the  selection  problem  is  tackled  from  a  decision  theoretic  point  of  view. 

In  selecting  the  best  population,  we  take  into  account  the  cost  of  sampling 
and  the  penalties  for  taking  a  wrong  decision.  This  kind  of  approach  has 
been  promulgated  by  Somerville  (1954)  and  Ofosu  (1972).  We  are  basically 
interested  in  selecting  the  best  Pareto  population  following  the  lead  given 
by  Somerville  (1954)  and  Ofosu  (1972).  The  Pareto  probability  model  has  been 
found  useful  in  a  wide  variety  of  contexts.  Pareto  (1927)  proposed  this  model 
to  study  the  distribution  of  incomes  in  various  societies  for  comparison.  In 
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medlcal  circles,  this  has  been  used  as  a  model  for  the  remission  rate  of 
discharged  psyciatric  patients  (See  Fox  and  Kraemer  (1971). )as  a  survival 
model  for  cardiac  patients  waiting  for  a  heart  transplant  operation, (See 
Wingo  (1979).).  In  the  context  of  economics  and  marketing,  this  has  been 
used  as  a  model  for  the  distribution  of  property  values,  business  mortality, 
migration  of  workers,  size  of  cities  and  firms,  and  consumer  prices.  See 
Steindl  (1965)  and  Koutrouvelis  (1981).  Ofosu  (1972)  has  worked  out  minimax 
sample  sizes  under  a  certain  penalty  function  in  the  selection  of  the  best 
Gamma  population.  We  feel  that  there  is  a  gap  in  one  of  the  steps  involved 
in  the  derivation  of  minimax  sample  sizes.  In  this  paper,  we  consider  four 
different  types  of  penalty  functions  including 

the  one  considered  by  Ofosu  (1972)'*.  Under  three  of  these  penalty  functions, 
we  derive  the  minimax  sample  sizes.  The  maximum  of  the  resultant  loss  functions 
is  explicitly  derived  overcoming  the  difficulty  faced  by  Ofosu  (1972).  We 
will  elaborate  on  this  at  the  appropriate  juncture. 

2.  General  formulation 

In  this  section,  we  present  general  ideas  in  selecting  the  best 
poulation  from  a  decision  theoretic  point  of  view.  In  the  next  section, 
we  specialize  in  the  Pareto  distribution. 

There  are  k  populations  wj,  itj.  •••  .  under  study.  Let 
X^.  denote  an  observation  selected  at  random  from  the  i-th  population  ir., 

1  =  l,2,***,k.  The  probability  law  governing  the  generation  of  X.'s  is 
described  by  a  probability  density  function  f(";0.)  which  depends  on  an 


unknown  parameter  ,  i  =  l,2,***,k.  We  assume  that  the  functional  form  of 
f  is  completely  specified  and  it  is  the  same  for  all  the  populations.  The 
only  unknown  quantity  that  enters  the  density  function  is  9^.  and  the  set 
of  all  possible  configurations  (6j,e2,***  ,0j^)  is  denoted  by  0  .  We  declare 
a  population  to  be  the  best  if  its  9-value  is  the  largest.  If  two  or  more 
-values  are  equal  and  largest,  we  adopt  a  well  defined  convention  in 
declaring  one  of  the  corresponding  populations  to  be  the  best.  One  such 
commonly  adopted  convention  is  the  following.  If  9.  and  9.  are  equal 
and  the  largest,  the  population  tt.  is  declared  to  be  the  best  if  j  >  i. 
Conventions  are  needed  only  in  the  calculation  of  probabilities  of  certain 
events  in  the  selection  problem.  In  reality,  if  9.  and  9.  are  equal  and 

^  w 

the  largest,  we  could  regard  both  the  populations  it.  and  it.  to  be  the 

*  J 

best. 

One  could  also  define  'the  best  population'  to  be  the  one 
whose  9-value  is  the  least.  The  treatment  of  the  selection  problem  within 
the  purview  of  this  definition  is  analogous  to  the  one  we  are  going  to 
develop  for  the  above  definition. 

A  selection  problem  basically  consists  of  two  components. 

(i)  Draw  a  random  sample  of  size  n^  from  population  it.,  i  =  1,2,*** 

(ii)  Develop  a  statistical  procedure  R  built  on  ^2  ^  ^  ’’k 

observations  which,  once  the  data  are  given,  clearly,  declares  the  best 
population. 

Introducing  a  good  statistical  procedure  R  to  select  the  best 
population  is  not  a  difficult  job  for  many  probability  models.  The  following 


is  a  natural  procedure.  Let  Y2,  •••  .  be  n  independent  identically 
distributed  random  variables  with  common  probability  dc  ity  function  f(*;0), 
e  unknown.  Choose,  if  possible,  a  minimum  variance  unbiased  estimator  of  e 
based  on  Yj,  Y2,  •••  ,  Yj^.  If  minimum  variance  unbiased  estimator  of  0 
does  not  exist,  choose  some  decent  estimator  of  0  based  on  '^2* 
which  has  good  asymptotic  properties.  Let  us  denote  the  chosen  estimator  by 
T(Yj^,Y2,*’*  ,Y|^).  (In  the  Pareto  case  we  are  going  to  discuss,  a  natural 
estimator  of  0  presents  itself.) 

Statistical  Procedure  R 

Suppose  the  data  from  the  k  populations  are  arranged  as  follows. 

Population  ir^  ^2  •••*  Tr|^ 

^21  •••*  hi 

hz  hz  ***’  ’^kz 

Data 


X  ^2n2  ••••  S 

Let  T.  =  j  =  l,2,-”,k. 

Population  is  the  best  if  T.  >  for  every  j  i  • 

For  simplicity,  here,  we  assume  that  each  T.  has  a  continuous  probability 
density  function.  If  two  or  more  T.  values  are  equal  and  the  largest,  the 
population  with  largest  suffix  i  is  declared  to  be  the  best  (our  convention) 
Of  course,  this  event  occurs  with  probability  zero. 


The  heart  of  the  matter  in  selection  problems  is  the  choice  of 


the  sample  sizes  n^,  n2»  •••  >  We  need  to  introduce  an  optimality 
criterion  so  that  the  sample  sizes  chosen  are  optimal  according  to  the 
criterion  proposed.  To  simplify  the  problem,  we  decide  in  advance  that  we 
intend  to  select  samples  of  the  same  size  form  each  population.  Let 
nj  =  n2  =  •••  =  n^  =  n. 


Following  Somerville  (1954)  and  Ofosu  (1972),  we  proceed  as 
follows.  We  adopt  the  statistical  procedure  R  described  above  to  select 
the  best  population.  No  statistical  procedure  is  infallible.  It  might 
declare  a  wrong  population  to  be  the  best.  Let  e^,  02.  •••  » 

be  a  configuration  of  the  populations  ttj,  Tr2»  •••  .  respectively. 

Let  <  9^2)  1  *•*  i  ®(k)  arrangement  of  e^,  02»  •••  »  6|^ 


in  increasing  order  of  magnitude.  Let  be  the  population  whose  parameter 

s3 


value  is  0^jj  ,  j  =  l,2,***,k.  If  two  or  more  0-values  coincide,  by  the 


convention  mentioned  earlier,  the  corresponding  i.'s  are  taken  to  be  in 


increasing  order  of  magnitude.  ’2*  ’  ^k  merely  a  permutation 

of  l,2,»**,k  which  depends  on  the  configuration  (0j,02.*** .e|^)  in  0. 

Let  the  statistic  T.  =  T(X^.  j,X.  2.**’.X^  „)  correspond  to  the  population 


J  J  0  J 

.  j  =  l,2,***,k.  According  to  the  configuration  (9j,92.***  .Sj^)  in  0  , 

.  is  the  best  population.  Under  the  given  configuration  (0j^,02.***  .9|^)  in 


0,  the  statistical  procedure  R  can  commit  k-1  different  types  of  wrong 
decisions.  The  list  is  given  below. 


Wrong  decision  ;  Type  j  :  ir.  is  declared  to  be  the  best. 

J  - - - - 1 

(j  =  1.2.--.,k-l) 

Correct  decision  :  is  declared  to  be  the  best. 
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In  order  to  calculate  the  probabilities  of  the  above  events,  we 
assume  that  the  probability  density  function  of  the  statistic  T  = 
T(Yj^,Y2,***  ,Y^) ,  where  Y^,  ^2*  ***  »  independent  identically 

distributed  random  variables  with  common  probability  density  function 
f(*;0),  is  of  the  continuous  type.  Under  this  assumption,  the  random 
variables  ^2’  **’  ’  distinct  with  probability  one  under  every 

configuration  (e2^,92i***  ,0|()  in  0.  Let  g(*;0;n)  be  the  probability 
density  function  of  T  =  T(Y^,Y2,*** ,Y|^)  and  G(*;0;n)  its  distribution 
function.  Let 


Pj(01,02,*** ,0k;n)  =  Prob(R  takes  a  wrong  decision  of  Type  j) 

=  %  ...  ft  (Ti  >  Ti  every  r  ^  j) 

’®k  V 

«  k 

=  ;  n  G(x;0(^j;n)  g(x;0(j.j;n)  dx, 

rj'j 


and 


j  =  l,2,*»*,k-l. 


*^k^®l’®2’**’ =  Prob(R  takes  the  correct  decision) 


=  Pr 


®l*®2’'**’®k  'k 


(T,.  >  T.  for  r  =  l,2,***,k-l) 

r 


k-1 


/  H  G(x;0^^j;n)  g(x;0^l^j;n)  dx  . 


r=l 


Cost  of  sampling 

The  cost  of  selecting  a  random  sample  of  size  n  from  each  population 


has  to  be  taken  into  account  in  the  determination  of  an  optimal  sample  size  n 
We  assume  that  the  cost  function  to  be  of  the  form 
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C(n)  =  Cq  n  =  1,2,  •••  , 

where  c^,  Cj  and  d  are  nonnegative  constants,  and  are  measured 
in  the  same  units,  and  represents  fixed  administrative  costs  involved 
in  setting  up  a  sampling  plan.  If  d  =  1,  the  cost  of  taking  additional 
samples  rises  linearly  with  n.  If  d<  1,  the  rise  in  the  cost  does  not 
increase  relatively  with  increasing  sample  sizes.  If  d  >1,  it  will 
become  more  and  more  expensive  to  take  additional  samples. 

Penalty  for  wrong  decisions 

Let  ***’\^  denote  penalty  for  taking  a  wrong  decision 

of  Type  j  which  is  measured  in  the  same  units  as  those  of  c^  and  Cj  for 
j  *  l,2,...,k-l  and  for  every  configuration  ® 

next  section,  we  discuss  several  choices  of  penalty  functions.) 


Loss  function 


For  each  configuration  ^^l*^2**’*’^k^  ^ 

sample  size  n  ^  1,  we  assume  that  the  loss  function  L  to  have  the 
following  structure. 


0 

2’ 


Cost  of  sampling  +  Average  penalty  for  wrong 
decisions 


k-1 


C(n)  +  ^  (Penalty  for  wrong  decision  of  Type  j)x 

j=l 

Pr(R  commits  wrong  decision  of  Type  j) 


C( 


k-1 


nl+  rWf®®  9\pf0  9  9 

11/  ^  i,  "j*  1*  2****’  k^  1*  2’*’*’  k’*  ^  * 

J  1 


Decision  theoretic  formulation 


We  now  identify  the  state  space  to  be  0  and  the  action  space  to 


be  N  =  {1,2,3,  •••  }  ,  the  set  of  all  possible  sample  sizes,  and  the  loss 
function  L  is  the  one  described  above.  It  is  defined  on  the  cartesian  product 
space  0  X  N.  The  minimax  sample  size  n  minimizes 

max  L(0;m) 

-v 

0e0 

over  all  m  in  N.  In  the  next  section,  we  specialize  the  case  of  Pareto 
distribution. 

3.  On  selecting  the  best  Pareto  population 

The  probability  density  function  of  a  Pareto  distribution  is  given  by 
A  A  *♦'1 

f(x;0;m)  =  0m  /x  ,  m  x  <  ®  , 

where  m  >  0  and  0  >  0  are  the  parameters  of  the  model.  We  recall  some 
distribution  theory  concerning  this  model.  For  details,  see  Johnson  and  Kotz 
(1970).  Let  independent  identically  distributed  random 

variables  with  common  probability  density  function  f  given  above.  Let 

m  =  min  X. 
lo^n  ^ 

and 

0  =  (log((  ! 

i=l  ’ 

A  A 

Then  m  and  0  are  jointly  sufficient  for  m  and  9.  Further, 

2n0/9  ^  X2n* 

2 

where  X2n  chi-square  distribution  with  2n  degrees  of  freedom. 


The  problem  to  which  we  address  ourselves  in  this  section  is  the 
following.  The  j-th  population  w.  has  Pareto  distribution  with  parameters 


'  V 


'.y  V  ■■  ■  r- T 


m.  and  e.,  both  unknown,  j  =  l,2,***,k.  We  want  to  select  that  population 

«1  tl 

TT .  for  which  9.  is  the  largest.  As  expostulated  earlier,  we  have  to 
•1  J 

strike  at  a  decent  estimator  T  of  e  based  on  n  independent  observations 
from  a  Pareto  distribution  with  parameters  m  and  0  in  order  to  describe 
the  statistical  procedure  R  in  the  selection  of  the  best  population.  From 
the  deliberations  carried  out  above,  a  clear  choice  emerges.  Let 


=  9  = 


(  n  X,) 


1/n 


log 


i=l 


1 


mm  X. 


l<i<n 


1 


-1 


(3.1) 


The  parameter  space  0  in  this  case  identifies  as  the  positive  orthant  of 
the  k-dimensional  euclidean  space,  i.e., 

0  =  1  9^  >  0  for  all  i}  . 

Statistical  Procedure  R 


F 


r 


Let  Xj^,Xj2i***>Xj^  be  a  random  sample  of  size  n  from 
population  it.,  j  =  l,2,**’,k. 

Declare  population  ir.  (i  =  1,2,»»»  ,k)  to  be  the  best  i f 

-  T,  >  T.  =  T(X„.Xj^.--.Xj„) 

for  all  j  f  i . 

where  the  statistic  T  is  given  by  the  formula  (3.1). 


The  error  probabilities  in  this  case  work  out  as  follows. 

"  ^  ®frl 

P,-(9i  ,9p,'--,0.  in)  =  ;  n  (1  -  (-i— ^  x))  g2n(x)  dx. 


(3.2) 


for  j  =  l,2,***,k-l,  where  g2n(')  ''s  the  probability  density 

function  of  a  chi-square  distribution  with  2n  degrees  of  freedom  and 
G2n(*)  its  distribution  function. 


The  probability  that  R 


®  k-1 

Pi.(eT  »eof •••  fSi-jn)  =  /  n 

^  ^  ^  0  r=l 


takes  the  correct  decision  is  given  by 


At  this  juncture,  we  state  some  inequalities  concerning  the 
above  probability  for  future  use. 


Lemma  3.1  For  every  (9j,e2,***  .0|^)  in  0  , 


Pk^®r®2’***’®k’"^  1  /  (1  - 

and  equality  holds  in  the  above  if  = 

Proof.  Obvious. 


Lemma  3.2  Let  (0j,92»’‘*  iSi^)  e  0  .  Write  0^|^^  =  (Obviously,  r  ^  1.) 

Then 


Pk (® 1 »® 2  *  *  *  * k ’ ^ ^  ~  ^  (1  ”  G2p(x/r)) 

and  equality  in  the  above  holds  if  0j^  =  02  =  ••• 
Proof.  Easy  to  check. 


g2n(^) 

=  9|^_j  and  0 


re 


k-r 


(01.02 


Now,  the  loss  function  L  takes  the  following  form.  For 
,9|^)  in  0  and  n  ^  1, 


k-1 

,0k»^)  *  C(n)  +  Z  ^j^®l*®2* 


■»k> 


/ 

n 


k 

n  (1  - 

r=l 


0/ 


92n<’'> 


L(9i,92.’** 


In  order  to  find  the  minimax  sample  size,  we  need  to  maximize 


the  above  loss  function  over  0  for  every  fixed  n  ^  1.  We  consider  four 

different  types  of  penalty  functions  W.'s  and  examine  the  behaviour  of 

J 

the  loss  function  under  each  one  of  these  penalty  functions. 


Penalty  function  of  Type  1  :  Constant  penalty. 


Let  ,e2»***  ,9|^)  =  c,  a  constant  for  every 

in  0  and  j  =  (For  simplicity,  take  c  =  1.)  What, this 

means  is  that  whatever  may  be  the  type  of  wrong  decision,  the  same  amount 
of  penalty  is  imposed.  In  this  case,  the  loss  function  L  simplifies  to 


L(0j ,02»* •  *  f0|^;n) 


k-1 

C(n)  +  Z  p.(0,  ,0p,“*  ,0L;n) 
J  1  ^  K 


=  C(n)  +  1  -  p,^(0j,02.***  .0|^;n) 

for  every  (0^ ,02i* •  •  «0|^)  in  0  and  n  ^  1.  The  maximization  of  L 
over  0  can  easily  be  worked  out  for  every  fixed  n  ^  1.  For  this,  let 
us  introduce  the  following  subset  of  0. 


0Q  =  {(02 ,02»** *  »9|()  e  0  ;  02  ®  02  ”  ***  " 

Proposition  3.3  For  every  n  ^  1 , 

max  L(?;n)  =  max  L(e;n)  =  (k-l)/k. 

i\j 

0e0 

Proof.  First,  we  note  that  if  (02  »e2»’ *  *  ^  ^2’  *’*  ’ 

are  independently  identically  distributed.  Consequently,  Pj(02.62«*** »® 


1/k  for  every  j  =  l,2,***,k.  Therefore, 


E  Pj(ej.02.-**.e,^;n)  *  (k-l)/k 


for  every  (0j.e2.***  .ej^)  in  0^.  Also, 


k-1 


max  E  p.(a;n)  =  max  (1  -  p^(Qin)} 


i=l 

9e0 


1  -  min  p,^(0in) 


'Vi 

0e0 


.k-1 


by  Lemma  3.1, 


=  1  -  P|^(0j.0ii***.0^;n) 

=  1  -  (1/k)  =  (k.l)/k. 


This  completes  the  proof. 


As  a  consequence  of  the  above  proposition,  we  have  the 
following  results. 

Theorem  3.4  For  every  fixed  n  ^  1, 

max  L(0;n)  =  C(n)  +  (k-l)/k. 

^e0 

Corollary  3.5  Under  the  Penalty  function  of  Type  1,  the  minimax  sample 
size  is  n  =  1. 


Penalty  function  of  Type  2  :  Penalty  function  which  takes  into  account 


differences  between  the  best  and  the  second 
best  populations. 
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It  is  natural  to  Impose  penalties  for  wrong  decisions  based 
on  the  magnitude  of  the  parameters  involved.  A  critical  point  in  the 
selection  problem  is  the  times  at  which  the  second  best  population 
passes  as  the  best  population.  If  in  0  is  a  configuration 

of  the  parameters  of  the  k  populations,  the  penalty  for  declaring 
the  second  best  population  as  the  best  population  should  naturally 
depend  on  and  Some  natural  penalty  functions  are 


'^k-l^®l’®2’*'*’®k^  =  C2 


'(k-1) 


=  C2  log  - 


(k-1) 


where  c^  >  0  is  measured  in  the  same  units  as  those  of  c  and  c, . 
i  0  1 

There  are  other  types  of  wrong  decisions  too.  We  could  insist  that 
the  penalty  for  other  types  of  wrong  decisions  should  be  atleast 

”j(ei*92*“-»9k)  =  “k.l(9r92’**‘*9k^’ 

for  j  =  l,2,***,k-2,  where  d^,  d2,  •••  ,  dj^  ^  3*^®  constants  exceeding 
unity.  It  is  natural  to  take  ^  ^2  ^  •••  ^  d|^_2  embodying  the 
principle  that  the  more  extreme  the  type  of  wrong  decision  the  more 
severe  the  penalty  is.  In  what  follows,  we  take 


W|^_1  ( 9 2 ,0 2  •  *  *  * 


’(k-1) 


The  loss  function  then  becomes 


L(9,,0,,***,9.  ;n)  =  C(n)  +  E  c-  d.(log  .  ■  )  P-.(e,  ,e»,*** ,0.  ;n) 

12  k  j=l  2  J  9(k-l)  J  1  2 

for  all  (92,02»**' .9|^)  ^n  0  and  n  1 ,  where  d|^_2  =  1. 


Our  next  objective  is  to  maximize  the  loss  function  L  over  0 


for  every  fixed  n  ^  1.  We  were  unable  to  maximize  the  loss  function  explicitly 
in  its  full  generality  stated  above.  The  following  results  give  explicitly 
the  maximum  of  this  loss  function  over  0  in  the  case  dj  =  d^  =  •••  = 


Before  stating  the  relevant  results 


we  introduce  the  following  subset  of  0. 


01  =  {(0j.02.***.eb)  e  0  ;  9i  =00 . 0U_1  =  <^0 


'1  °2  °k-l 

for  some  r  >  1} 


k  '°k-l 


Lemma  3.6  For  every  fixed  n  >  1,  let 

k-1  0, 


p  =  max 


E  (log  a— •-^)  P. (01.00. •••.0kin)  , 
j=l  ®(k.l)  J  ^  ^  " 


where  the  maximum  is  taken  over  all  (0jt02.***.0|()  ®. 


and 


•"-I  ®?kl 

q  =  max  r  (log  A  ■ —  )  p.(0?,05,*** ,0K;n)  , 
j=l  ®(k-l)  J  ^  2  " 

where  the  maximum  is  taken  over  all  (0j * •  .0jj)  in 


Then 


k-2 


p  =  q  =  max  (log  r)  (k-1)  /  (1  -  G2p(x))  "  (1  -  G?n(xr))  ‘lx 


r>l 


’2n'"'  V2n' 


.k-1 


=  max  (log  r)  (1  -  /  (1  -  G2^(x/r))  '  92n^^^ 

r>l  0 


Proof.  Since  0^  C  0  .  d  1  P*  We  now  prove  p  <  q.  For  this,  it  suffices 


to  show  the  following.  Given  (ej,82.**‘ .6|^)  in  0,  there  exists 
,0jj)  in  such  that 

k-1  0,.>  k-1  ef.x 

I  (log  p.(9,  ;n)  <  z  (log  )  p^.(9?.9*  »»■ 

j=l  ®(k-l)  J  1  2  k  J  1  2 


Let  0?  =  9J  =  •••  =  eT. 


'1  ^2 


k-1  "«(k-l)  ®k  =  ®(k)-  OKklously,  ef  - 


for  some  1  and  (9j,e|,*** ,9j^)  e  Gj.  Therefore, 
k-1  9/ki 

Z  (log  -^  ■)  p.(9,  .9^,««»,9.  ;n) 
j=l  ®(k-l)  J  1  2  k 

k-1 

=  Z  (log  r)  p.(9,.ep,***,9.  ;n) 
j=l  J  A  K 

*  (log  r)  (1  -  p,^(9j,82.*»*,0|^;n)) 


<  (log  r)  (1  -  J  '  G2j,(x/r))'^‘^ 

<  (log  r)  (1  -  p,^(9j,9|.***,0jj;n)) 

k-1 

=  (log  r)  z  p^(9?,95,***,9j;n) 
j=l  J  ^  ^ 

®fkl 

=  Z  (log  p.(9?,05,---,0?;n)  . 

j=l  ®(k-I)  J  1  2  k 

This  proves  that  p  ^  q  and  hence  p  =  q.  Incidentally,  if  9^  =  0|  =  • 
®k-l  ®k  “  '^®k-l  every  j  =  l,2,***,k-l. 


we  have 


••.ej-.r,)  .  /  (1  -  92„('>)  lx 

PJ^j 


=  /  (1  -  G2n(x))  (1  -  G2„(xr))  g2n(x)  dx. 


Pj(el.a|,. 
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Consequently , 


k-1 


«k 


max  z  (log -^)  p.(9?,eJ,***  ,ej;n)  , 

j=l  ®k-l  J  A  ^  K 

where  the  maximum  is  taken  over  all  (0|te|,‘ 


.ej^)  in  ej. 


=  max  (log  r)  (k-1)  /  (1  -  G,„(x))'^'^  (1  -  G,„(xr))  g,„(x)  dx 


r>l 


"2n 


2n' 


'2n' 


max  (log  r)  (1  -  P^(8 ,0 ,••• ,6 ,re ;n)) 


r>l 


.k-1 


=  max  (log  r)  (1  -  /  (1  -  G,  (x/r))  ‘  gp_(x)  dx), 
r^l  0 

This  completes  the  proof. 

As  a  consequence  of  this  lemma,  we  obtain  the  following  result. 
Theorem  3.7  For  every  n  ^  1 , 

max  L(0;n)  =  max  L{^;n) 


% 

0e0 


0C01 


k-2, 


P 


1. 


♦  *  “r* 


=  C(n)  +  Cg  max  (log  r)(k-l)  J  ’  (1  -  ^20^’'^ 


dx 


k-1 


=  C(n)  +  c^  max  (log  r)  (1  -  /  (1  -  G2n(x/r))  '  g2n(x)  dx) 


r>l 


Theorem  3.7  reduces  the  problem  of  maximizing  L  over  0  to  the 
problem  of  maximizing  the  following  'unction 

00 

f(r)  =  (log  r)  (1  -  ^  ~  92n^^^ 


r'  1 


over  all  r  >  1.  In  view  of  Theorem  3.7,  we  modify  the  notation  of  the  loss 


function  from  L(e2,62.***i0|^;n)  to 


L(r;n)  =  C(n)  +  f(r) , 

r  >  1  and  n  >  1, 


There  are  two  stages  involved  in  finding  the  minimax  sample  size. 

Stage  1.  For  every  fixed  n  ^  1,  maximize  L(r;n)  over  all  i"  i  1-  This 

is  equivalent  to  maximizing  f(r)  over  all  r  >  1.  Let  r  . 

^  n  f  lx 

be  the  value  at  which  f(r)  is  maximum. 

Stage  2  Then  find  the  minimum  of  L(r_  ,.;n)  over  all  n  >  1.  The  value  of 

fi  j 

n  at  which  L(r  ,  ;n)  is  minimum  is  the  required  minimax  sample 

n  f  K 

size. 

Maximization  of  f(r)  over  all  r  ^  1  is  not  easy  analytically. 

We  observe  that  f(*)  has  the  following  properties,  (i)  f(l)  =  0  (obvious) 

(ii)  lim  f(r)  =  0.  In  order  to  understand  the  function  f(*)»  we  used  a 
r-H" 

numerical  quadrature  formula  to  evaluate  f(r)  for  an  extensive  range  of 
values  of  r  and  k.  A  sample  of  these  findings  is  reproduced  below. 


Table  1  :  Tabulation  of  values  of  r_  the  value  at  which  f(*)  is  maximum 


Sample 

size 


No.  of  populations,  k 


The  following  information  emerges  from  these  studies,  (iii)  For 
every  fixed  n  and  k,  f(»)  seems  to  be  unimodal .  (iv)  For  every  fixed  k, 

r  .  decreases  with  increasing  n,  i.e.,  u  if  m  >  n.  (This 

property  useful  when  determining  minimax  sample  sizes.) 

Ofosu  (1972),  in  his  study  of  selecting  the  best  of  k  Gamma 
populations,  suggested  two  methods  of  determining  minimax  sample  sizes. 

Method  1  Choose  a  certain  range  of  plausible  values  of  sample  size  in  which 

we  hope  the  minimax  sample  size  lies.  Find  r  .  for  every  n 

n  f  K 

in  the  range  selected.  Evaluate  L(r  .  ;n)  for  every  n  in  the 

n  ^  In 

range.  Then  that  value  of  n  for  which  L(r  .  ;n)  is  minimum 

ll  j  In 

is  the  desired  mini max  sample  size. 

This  method  involved  a  lot  of  computer  time.  Determination  of 

plausible  minimax  values  of  n  involves  an  extensive  tabulation  of  L(r;n)'s. 

Further,  locating  r  .  's  is  a  time  consuming  process.  The  second  method 

n  j  ^ 

we  are  going  to  describe  now  is  a  slight  modification  of  another  method 
suggested  by  Ofosu  (1972).  This  method  uses  the  property  (iv)  mentioned  above. 

Method  2  k  is  fixed.  Find  r2  |^,  the  point  at  which  f(’)  is  maximum 

when  the  sample  size  is  2.  Evaluate  L(r;n)  for  r  =  1.0,  1.1,  1.2, 
•  ••  ,  r2  and  n  =  2,3,4,  •••  ,20.  These  values  are  tabulated 
in  the  two-way  grid,  with  rows  corresponding  to  r  and  columns 
corresponding  to  n.  For  each  column  in  the  two-way  grid,  locate 
the  maximum.  Under  each  column  representing  a  particular  sample 
size  n,  we  know  that  the  maximum  of  L(r;n)  occurs  at  some  value 
of  r  between  1.0  and  r^  |^.  See  property  (iv)  above.  These 
maxima  exhibit  a  decreasing  trend  to  start  with  and  then  rise 
steadily.  We  locate  the  column  for  which  the  maximum  entry  is 


minimum.  The  corresponding  sample  size  is  taken  to  be  the  mimmax 
sample  size.  If  these  column  maximums  exhibit  a  downward  trend  over 
the  entire  sample  range  from  2  to  20  considered,  then  the 
minimax  sample  size  is  ^  20.  In  this  case,  we  need  to  extend  the 
range  of  values  of  n.  We  may  need  to  use  asymptotic  expressions 
for  f(*)  for  large  values  of  n.  This  aspect  is  discussed  in  the 
last  section. 

We  found  that,  in  practice.  Method  2  works  faster  than  Method  1.  By 
way  of  illustration,  we  have  adopted  Method  2  to  find  the  minimax  sample 
size  for 

k  =  2, 3, 4, 5; 

Cg  »  0,  Cj  =  1  and  d  =  1;  and 

C2  *  1000. 

The  loss  function  L  is  given  by 

L(r;n)  =  kn  +  1000  (log  r)(l  -  ^  •  02p(x/r) 92n^^^  ' 

r  ^  1  and  n  ^  1.  A  sample  two-way  grid  is  given  on  the 
following  page  for  k  =  4.  From  the  computations  performed  using  Method  2, 
the  following  information  is  obtained. 

Table  2  :  Minimax  sample  sizes  under  Penalty  function  of  Type  2 


No.  of  populations  Minimax  sample  size  Minimax  loss 


£ 

!  15 

92.54 

1  16 

142.53 

i  18 

183.83 

;  14 

218.97 

r 


Penalty  function  of  Type  3  :  Penalties  which  take  into  account  differences 

between  the  best  and  the  rest  of  the  populations. 

Suppose  the  statistical  procedure  R  described  above  commits  the 
mistake  of  type  j  {j  =  l,2,-**,k-l).  It  is  natural  to  insist  that  the  penalty 
for  this  wrong  decision  should  depend  on  the  magnitudes  of  and 

Ofosu  (1972),  accordingly  introduced  the  following  penalty  functions. 


•«!<>  *  '“9  «(k)/»(J) 

for  all  (0j,e2,*-*  .0|^)  in  e  and  j  =  l,2,*«*,k-l,  where  a^.'s  are 

positive  constants  measured  in  the  same  units  as  those  of  c  and  c, .  The 

0  1 

loss  function  L  then  works  out  to  be 


L(02,02.**‘.0i^;n)  *  C(n)  +  z  a^dog  ®(|()/e(j))  Pj(ei  .02**  *  * 

3  ^ 

for  all  (0j^,e2»*** »0|^)  in  0  and  n^  1.  Ofosu  (1972)  asserts  that 


max  L(0;n)  =  max  L(0;n) 

0e0  0e0j^ 

that 

for  every  n  >  1  and.this  equality  can  be  verified  numerically  only!  He  has 

~  A 

no  analytical  proof  of  this  equality.  Then  he  went  on  to  obtain  the  minimax 
sample  size  for  selecting  the  best  of  k  Gamma  populations.  We  also  do  not 
have  any  analytical  proof  of  this  equality. We  abandon  the  project  of  working 
with  penalty  functions  of  this  type. 


Penalty  function  of  Type  4  :  Penalties  are  imposed  only  when  the  ratio  of 

the  parameter  values  of  the  best  population  and 
the  rest  exceed  a  prescribed  number. 


In  this  part,  we  are  going  to  introduce  --'a'  new,  natural  and  special 


penalty  function.  Let  0  <  5  <  1  be  given.  Let  e^,  62.  •••  .  be  a 

configuration  of  the  parameters  of  the  k  populations  ir^,  1:2.  •••  . 

respectively.  Suppose  the  statistical  procedure  R  declares  it.  to  be  the 

^  j 

best  for  some  j  =  l,2,*-*,k-l.  The  parameter  values  associated  with  the  best 

population  tt.  and  tt^.  are  and  respectively.  If  the  ratio 

k  j 

0^j)/0(l^)  (  1  1)  of  0^.^  and  0f|^^  is  close  to  unity,  we  would  not  like 


'(j) 


’(k) 


to  be  penalized  for  taking  the  wrong  decision  of  accepting  to  be  the 

J 

best  population.  On  the  other  hand,  if  the  ratio  ®(j)/®(k)  small,  we 

certainly  wish  to  be  penalized  for  accepting  tt.  to  be  the  best.  A  line  has 

j 

to  be  drawn  somewhere  between  the  statements  that  the  ratio  ®(j)^®(k) 
close  to  unity  and  that  ij;  is  being  small.  The  number  6  distinguishes  these 
two  statements  and  the  choice  of  6  is  subjective.  The  discussion  carried 
out  above  can  be  embodied  mathematically  in  the  following  way. 


'  0  ®(J)^®(k)  “  *’ 

for  all  (ej,02»*** f9|^)  In  0  and  j  =  l,2,***,k-l,  where  aj's  are 

positive  numbers  measured  in  the  same  units  as  those  of  c  and  c, .  To 

0  1 

emphasize  the  gravity  of  the  type  of  wrong  decision  taken,  one  may  wish  to 
have  1  ^2  -  -  ®k-l'  chain  of  inequalities  indicates  that 

the  more  extreme  the  type  of  wrong  decision  the  more  severe  the  penalty  is. 

We  deal  with  only  the  case  =  a2  =  •••  =  a|^_2  =  a,  say.  We  were  unable  to 
obtain  concrete  results  in  the  general  case  of  these  constants.  Under  this  case 
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the  loss  function  L  works  out  as  follows. 

Let  (0 j ^e2.•  •  •  f0|^)  in  Q  be  given.  Let  r  be  the  largest  index, 
if  it  exists,  in  {1 ,2,* •  •  ,k-l}  such  that  ®(r)'^®(k)  -  ^  implies 

that  ^(j)'^^(k)  —  ^  for  j  =  l,2,»**,r  and 

r+2,‘«*,k.  Then 

r 

L(ej,e2.-**  .0,^;n)  =  C(n)  +  a  z  Pj(ej,e2.*‘- .Qj^in). 

J  ^ 

If  there  is  no  index  r  satisfying  the  above,  then 
L(ej,92.**- .0|^;n)  =  C(n). 

The  index  r,  of  course,  depends  on  the  configuration  (ej,e2.**’ »e|^)  in  0. 

We  now  proceed  to  obtain  the  minimax  sample  size  n  under  the  loss 
function  given  above.  Towards  this  goal,  we  consider  the  following  subset  of  e. 

®2  ~  ^(01*02’"' e  0  ;  9j  =  @2  =  •  •  •  =  0^  and  =  9^^.2  ~ 

•••  =  0|(  ®  (l/0)0p  for  some 

r  in  {l,2,***,k-l}}. 


The  following  results  help  us  to  maximize  the  loss  function  L(0;n) 
over  all  e  e  0  for  every  fixed  n  >  1. 


Lemma  3.8  Let  (0j  ,02.*  *  *  .0|()  e  0  •  Suppose  there  exists  a  largest  index 
r  in  {1 ,2,«**  ,k-l}  such  that  ®(r)'^®(k) 


9*  =  0*  =  ...  =  0*  and  9*^^  =  0*^^  =  . 

where  0*  =  0(^)-  Then  (9j,0|,*** ,0j^)  e  ©2 


=  e*  =  (1/6)0*, 


’“k’  '  --  ’  k’ 

for  every  n  ^  1. 

Proof.  From  the  definition  of  02  *  ii^  is  clear  that  (9j^,0|t*  *  *  »0j^)  e  02* 
Observe  that 

p^(9j.6|,....ei;:n)  =  /  V  (1  -  G2„(^  x))  g2„(x)  dx 

=  /  (1  -  G2„(6x))''  (1  -  S2n(x))'‘"^‘''  g2n(x)  dx. 

9  /  j  \  ®  ( 1 1 

Note  that  -r^—^  <  S  for  every  i  =  l,2,***,r  and  - -  <  1  for 

®(k)  -  ®(k)  ■ 

every  i  =  r+1,  r+2,-**,  k.  Consequently, 

•  k-1 


«  r 


x))  n  (1  -  G,  x))  g.  (x)  dx 


=  /  n  (1  -  6-  X  )  n  1  -  G2_(^  X))  g2_( 

0  i*l  ®(k)  i=r+l  ®(k) 


>  /  (1  -  G2„(«x))''  (1  -  920^’'^ 


=  P|^(eJ,8|,***  ,ejj;n) . 


This  completes  the  proof. 


Lemma  3.9  The  following  equality  is  true  for  every  n  >  1. 

pjax  L(?;n)  =  max  L(§;n). 


Proof.  Let 


p  =  max  L(0;n)  and  q  =  jjjax  L(0;n).  Since  0»C  e,  q  £  p 
0e0  ee0, 


To  prove  the  reverse  inequality  p  £  q,  it  suffices  to  prove  the  following. 


Given  (0j,02.***  .0|^)  in  0  ,  there  exists  (0 J,0|,* •  •  ,0jj)  in  02  such  that 


k-1.  Then  L(0j,02.***  .9|^;n)  =  C(n)  +  0  =  C(n)  £  1(0^  *  *  .0jj;n) 


such  that 

0 


that 


(r) 


0 


(k) 


L(0j 

,0  2, 

0  i 

L(02 

.02,*** 

•®k’' 

in 

<  6  . 

Let 

®(k) 

<  6 

.  Let 

L(0j ,02.* ••  .0|^;n)  _<  L(0j,0|,***  ,0j^;n) , 


®(j) 


'(k) 


>  6  for  all  j  =  1,2,< 


=  0;  =  0(^)  and  =  0*^2 


0j^  *  (l/5)0^^j.  Obviously,  (0|,0|,“* ,0j^)  e  02*  Also, 

r 

L(0j,02.“*  >9|^;n)  =  C(n)  +  a  l^  Pj(ej,02.’**  .0|^;n) 

k-1 

<  C(n)  +  a  Z  p-(9,,0«,***»0u,n) 
j,l  J  1  ^  K 

“  C(n)  +  a(l  -  P|^ (9 2 ,02 » *  *  *  »0  in ) ) 


<  C(n)  +  a(l  -  p,^(9^,0|,***  ,9|^;n)),  by  Lemma  3.8 

k-1 

=  C(n)  +  a  E  p.(0f ,05,*** ,0j;n) 
j=l  J  ^  ^ 

=  L(9j,9|,‘**.0j^;n). 

This  completes  the  proof. 


The  above  lemma  simplifies  the  problem  of  maximizing  L(*;n)  over  © 


to  that  of  maximization  of  L(«:n)  over  Q^.  We  now  solve  the  problem  of 
maximization  of  L(*:n)  over  We  partition  as  follows.  Let 

©2^  "  ^  (0^ » ^2  •  *  *  *  ^  ^2*  ^1  "  ^2  "  •••  “  0j  and  *  ^j+2 

“  •••  “  ®k  ”  (l/6)6j^}  t 

for  j  =  1,2, ••• ,k-l. 

Lemma  3.10  The  following  statement  is  true.  If  (6j^  ,62»  *  *  *  e  ®2j  ’  then 

00 

P^(0j^f92****  (1  -  G2j^(x))^  ^  ®2n^*^  *** 

for  i  »  l,2,***,j  and  j  »  l,2,**»,k-l. 

Proof.  The  above  assertion  follows  directly  from  the  definition  of  the 
probabilities  and  Q2j'®* 

Theorem  3.11  For  every  fixed  n  ^  1, 

00 

^ax  L(9;n)  »  C(n)  +  a(l  -  /  (1  -  G_  (6x))^  ^  ®2n^*^ 

9e0  0 

00 

-  C(n)  +  a(k-l)  /  (1  -  G2jj(x/6))(1  -  ^20^*^^^^”^  ®2n^*^ 

Proof .  The  proof  is  carried  out  in  the  following  two  major  steps. 

1°.  In  view  of  Lemma  3.9,  we  maximize  L(0;n)  over  Q^.  As  a  first  step, 
we  maximize  L(*;n)  over  for  every  i  ”  l,2,***,k-l.  We  observe  that 

t\j  ^  (y, 

Igax  L(9;n)  =  C(n)  +  |gax  a  E  p  (9;n) 

9602^  9e02^  j-1  ^ 

00 

-  C(n)  +  ia  /  (1  -  G2^(x/6))*'"^  (1  -  G2jj(x))^‘^  g2jj(x)  dx, 
by  Lemma  3.10. 


The  maximum  of  L(6:n)  over  @2  k  1  another  equivalent  expression.  It  Is 

given  by 

jgax  L(9;n)  ■  C(n)  +  a  max  E  Pj(0;n) 


0e0 


2,k-l 


C(n)  +  a  ;gax  (1  -  p.  (0;n)) 

®^®2.k-l 


k-1 


C(n)  +  a(l  -  I  H  -  '■ 


‘2n' 


2°,  The  following  chain  of  inequalities  can  easily  be  checked  to  be  true. 
00  00 

/  (1  -  G2jj(x/6))'‘~^  *2n^*^  (1  -  ®2n^*^ 


i.  /  (1  -  G2^(x/5))’^"^  (1  -  G2jj(x))^  g2^(x)  dx  < 


_</(!-  G2^(x/6))  (1  -  G2jj(x))*^“^  *2n^*^ 


Since  02^,  i  ■  l,2,«*»,k-l  is  a  partition  of  02*  we  have 

00 

jgax  L(?;n)  -  max  {C(n)  +  ia  /  (1  -  G2  (x/6))*^”^  (1  -  G2jj(x))^~^  g2 

0e02  0 

CO 

-  C(n)  +  a(k-l)  /  (1  -  G2j^(x/6))(1  -  G2jj(x))'^”^  ®2n^*^ 


-  C(n)  +  a(l  -  /  (1  -  G2jj(<5x))'^"^  ®2n^*^ 


This  completes  the  proof. 


k.v. 


Is 
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The  above  theorem  explicitly  determines  the  maximum  of  L(d:n)  over 
all  ?  In  6.  This  maximum  Is  the  same  as  the  maximum  of  L(6;n)  over  all 
In  ®2  k  1*  called  "Least  Favourable  Choice  Set".  In  order  to  find  the 

minimax  sample  size,  we  have  to  solve  the  following  problem. 

Obj ectlve.  For  a  given  0  <  6  <  1,  minimize 


C(n)  +  a(l  -  /  (1  -  G2^(dx))’^‘^  g,„(x)  dx) 


»2n' 


over  all  n  >  1. 


Ue  obtain  the  mlnlmax  sample  sizes  for  £  >  0.5,  0.6,  0.7,  0.8, 0.9; 

k  =  2, 3, A, 5;  a  »  200;  c^  *  0,  Cj^  =  1  and  d  ■  1  by  solving  the  above  problem. 
These  sample  sizes  are  tabulated  below. 


Table  A  :  Mlnlmax  sample  sizes  under  Penalty  function  of  Type  A 


No.  of 
populations 

k 

6 

. 

0 

.5 

0 

.6 

0 

.7 

0 

.8 

Mlnlmax 

sample 

size 

Mlnlmax 

loss 

Mlnlmax 

sample 

size 

Mlnlmax 

loss 

Mlnlmax 

sample 

size 

Mlnlmax 

loss 

Mlnlmax 

sample 

size 

Mlnlmax 

loss 

2 

10 

32,95 

12 

A5.80 

12 

62. 8A 

8 

82.07 

3 

11 

53. lA 

12 

73. 3A 

11 

98.20 

6 

120.55 

A 

11 

71.68 

11 

97.23 

8 

12A.99 

3 

1A3.26 

5 

10 

88.89 

10 

118.10 

6 

1AA.7A 

2 

157.26 

6  -  0 

.9 

Minimax 

sample 

size 

Mlnlmax 

loss 

2  ^ 

2 

96.01 

3 

2 

131.87 

A 

2 

151. A6 

5 

2 

167.50 

mlnlmax  value  of  n 


In  Section  3,  we  obtained  mlnlmax  sample  sizes  under  penalty 
function  of  type  2  and  the  relevant  loss  function  Involved  Is 

00 

L(r;n)  -  kn  +  1000  (log  r)  (1  -  /  (1  -  G^^Cx/r))^”^  *2n^*^ 

for  r  i  1  and  n  ^  1.  (4.1) 

Using  Mehtod  2,  In  order  to  find  the  mlnlmax  sample  size  for  a  given  k, 

we  have  tabulated  the  values  of  L(r;n)  for  r  «  1.0,  l.l,***,r2  and 
n  >  1,2, •••,20.  Fortunately,  the  mlnlmax  sample  size  n  was  one  of  the 
numbers  2, 3, •••,20.  It  could  happen  that  the  mlnlmax  sample  size  n  is 
a  number  beyond  20.  The  constant  1000  appearing  In  the  above  loss  function 
played  a  crucial  role  In  keeping  mlnlmax  sample  sizes  to  a  moderate  level. 

If  we  replace  1000  by  a  larger  number,  mlnlmax  sample  sizes  do  Increase. 

To  Illustrate  this  point,  we  worked  out  the  mlnlmax  sample  sizes  under  the 
loss  function 

CD 

L(r;n)  -  kn  +  1200  (log  r)  (1  -  /  (1  -  G2^(x/r))  ’ 

for  r  _>  1  and  n  ^  1,  (4.2) 

and  these  are  tabulated  below. 

Table  5  :  Mlnlmax  sample  sizes  under  the  loss  functions  (4.1)  and  (4.2) 

No.  of  populations  Mlnlmax  sample  size  Mlnlmax  sample  size 
_  under  (4.1)  under  (4.2) 

2 

3 

4 

5 


L5 

L6 

L8 


If  we  increase  the  value  of  C2,  we  may  need  to  compute  the  value 
of  L(r;n)  for  n  >  20.  In  this  section,  we  approximate  the  integral  involved 
in  L(r;n)  by  an  integral  involving  the  standard  normal  distribution  for 
large  values  of  n.  The  integral  in  focus  is 

/  (1  -  G2n(x/r))‘^'^  g2„(x)  dx  .  (4.3) 

The  above  integral  can  be  obtained  probabilistically  as  follows.  Let  Yj,  Y2, 

Y|^  be  k  independent  identically  distributed  random  variables  each  having 
chi-square  distribution  with  2n  degrees  of  freedom.  Then 


Pi,(r;n)  »  Pr(Y,^  <  rY^,  j  *  1,2, -...k-l) 


.k-1 


=  /  (1  -  G2^(x/r))  *  92n(^) 


Employing  Wilson-HIl forty's  transformation,  we  define 

1/3  _ 

T 


(Y./2n)^''’  -  (1  -  l/9n) 

Zj  *  1  >  i  ®  l,2,»»»,k. 

'  (l/9n)f 


See  Kendall  and  Stuart  (1977,  p. 398-399).  If  n  is  large,  each  is  normally 
distributed  with  mean  zero  and  variance  unity.  We  note  that 


Yk  <  rYj  if  and  only  if  Zj^  <  +  a^^  , 


where 


n,r 


(1  -  l/9n)(r^^^  -  1) 


(l/9n)’ 

Consequently, 

P|^(r;n)  =  Pr(Y,^  ^  *  l,2,*“,k-l)  = 

*  Pr(Z^  <  +  a_,  j  »  1,2,..., k-l) 


■  ^  -  «(  ^  «P(-  <<*. 

where  '!'( •)  is  the  distribution  function  of  the  standard  normal  probability 
model.  Thus  for  large  n,  the  loss  function  works  out  to  be 

L(r;n)  =  kn  +  Cg^.log' r()(l  -‘ /(!-$( - 177^))*^'^  (2^f)"*  exp(-  x^/2)  dx) 

r 

for  r  >  1  and  n  >  1. 


The  integral  involved  in  the  loss  function  under  penalty  function 
of  type  4  is  given  by 

/  (1  -  92n^^^ 

As  before,  if  n  is  large,  this  integral  is  approximately  equal  to 


where  b„  , 
n  ,6 


/  (1  -  <t(5^/^{x  -  (2w)‘*  exp(-  x^/2)  dx. 


(1  -  l/9n)((l/5)^/^  -  1) 
(l/9n)* 


.  The  loss  function,  if  n  is  large. 


under  penalty  function  of  type  4  is  given  by 
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